Fix issues with split and split_dims #1828

ricardoV94 · 2026-01-06T12:08:10Z

Fix bug when passing simple Tensor shape to split_dims
Change grad_undefined -> grad_disconnected for split_sizes in SplitOp (see #1827 for more context)

pytensor/tensor/basic.py

ricardoV94 · 2026-01-09T01:53:29Z

pytensor/tensor/basic.py

+        ):
+            # All elements already have the right number of dimensions, so we
+            # can just join them directly.
+            return join(0, *x)


This isn't equivalent to stack below?

No, because stack adds a dimension. This was causing a bug in split_dims where we ask explicitly ask for ndims=1, passing a sequence of 1d tensors, but then we get back a 2d tensor.

Hmm, so my understand is this function is supposed to do what np.array(x) would do. I think the ndim is more of an assert, it should fail when the output of np.array (in our case the symbolic equivalent) would yield something different. So in that sense join is never valid as it keeps the same dimensions.

I want to revert and check if I'm missing something with the test that was failing.

Sure. From my perspective the biggest issue is that as_tensor_variable(..., ndims=1) isn't idempotent -- sequential calls on the same input keep mutating the same graph. This is happening because of stack.

That's odd because if it's already a single tensor variable (and not a list with one in it) it shouldn't do anything

Yeah that first one seems wrong.

Even if fix it, I think our check for "sequence" on split_dims (or wherever the problem was) should be more like if isinstance(x, Sequence) or (isinstance(x, TensorVariable) and x.ndim == 1)

1d numpy arrays should also be valid, but maybe those pass the Sequence instance check.

Maybe we should remove the ndim argument altogether? numpy doesn't have it and I don't think we need it.

I thought it was just used for validation but it seems to affect non-raising outcomes

I'm +1 for removing it. I never knew it existed, and it seems like it's overloading the function.

If I had to guess though, it's exactly for this situation. We have an argument with type int | Variable | tuple[int | Variable]. The Variable, though, can be either a scalar or an array. So really the typing is something like int | Variable[ndim=0] | Variable[ndim=1] | tuple[int | Variable[ndim=0]. When we do the if not isinstance(shape, tuple): shape = (shape, ) we're ignoring the Variable[ndim=1] case. Calling as_tensor_variable(tuple[Variable[ndim=0]) -> Variable[ndim=1] makes sense to me, and matches the numpy behavior. In this case we're counting on the ndim=1 arugment to guard against the case of as_tensor_variable(tuple[Variable[ndim=1]) -> Variable[ndim=2].

Typing all this out, it seems like an abuse of the as_tensor_variable function.

Yeah agreed. Would be really nice to be able to have those TensorVariable[ndim=0] types btw. Need to nerdsnipe some type hint lovers

jessegrabowski · 2026-01-10T01:11:07Z

I reverted the changes to as_tensor_variable. At minimum it's out of scope for this PR. Implementing more careful checks of the shape argument (based on the analysis in the comment above) was sufficient to clear the test failures. We can revisit the ndims argument later.

Something else I noticed was that we're passing dtype to as_tensor_variable. This doesn't do anything in the Variable case, so I changed it to an explicit cast (inside the Op make_node, I left it in the wrapper to handle the Sequence case)

ricardoV94 · 2026-01-10T07:51:55Z

No, better not to cast variables in node but raise like before. That's what shape ops always do. If a user passes a float as a shape argument it's likely a bug and this would mask it

jessegrabowski · 2026-01-10T17:49:42Z

Someday I will merge a PR

ricardoV94 · 2026-01-10T18:49:30Z

pytensor/tensor/reshape.py

+        )

-    if not shape:
+    if empty_shape:


What about just shape.type.shape == (0,), for the variable case? Also if you standardize as_tensor_variable you don't need the variable vs non-variable case

But also do we need the special squeeze branch or would the Op do the right thing anyway?

Tests pass without it (as long as I adjust the existing test_split_size_zero_shape test to pass dtype int to the shape argument), so I guess not.

ricardoV94 · 2026-01-10T21:16:16Z

I'm happy with the PR. I'll fix the git history and merge

ricardoV94 added 2 commits January 6, 2026 11:26

SplitDims: Fix scalar tensor shape

5e1c135

Split: Return disconnected gradient for split sizes

0e2a146

ricardoV94 added bug Something isn't working gradients Op implementation labels Jan 6, 2026

This was referenced Jan 6, 2026

Use pack and unpack in minimize and root #1806

Open

Confusion between grad_undefined / grad_disconnected #1827

Closed

jessegrabowski mentioned this pull request Jan 8, 2026

Do not coerce gradients to TensorVariable #1685

Merged

ricardoV94 commented Jan 9, 2026

View reviewed changes

jessegrabowski added 2 commits January 9, 2026 18:50

Use type-consistent check

7ef2398

Stricter validation of shape argument

579566d

jessegrabowski force-pushed the split_dims_tweak branch from 4f38402 to 579566d Compare January 10, 2026 01:08

Try again

dfe2d0f

ricardoV94 commented Jan 10, 2026

View reviewed changes

Remove squeeze branch in split_dims

43ff4b0

Fix issues with split and split_dims #1828

Are you sure you want to change the base?

Fix issues with split and split_dims #1828

Conversation

ricardoV94 commented Jan 6, 2026

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jessegrabowski Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jessegrabowski commented Jan 10, 2026

Uh oh!

ricardoV94 commented Jan 10, 2026

Uh oh!

jessegrabowski commented Jan 10, 2026

Uh oh!

ricardoV94 Jan 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ricardoV94 commented Jan 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jessegrabowski Jan 9, 2026 •

edited

Loading

ricardoV94 Jan 9, 2026 •

edited

Loading

ricardoV94 Jan 9, 2026 •

edited

Loading

ricardoV94 Jan 9, 2026 •

edited

Loading

ricardoV94 Jan 10, 2026 •

edited

Loading